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At the LHC combinatorics make it unlikely that we will be able to observe stop pair production 
with a decay to a semi-leptonic top pair and missing energy for generic supersymmetric mass spectra. 
Using a Standard-Model top tagger on fully hadronic top decays we can not only extract the stop 
signal but also measure the top momentum. To illustrate the promise of tagging tops with moderate 
boost we include a detailed discussion of our HEPTopTagger algorithm. 

I. INTRODUCTION 

Searches for top squarks at hadron colhders aim at a fundamental questions of electroweak symmetry breaking 
— if the Higgs boson should be a fundamental scalar, how can its mass be stabilized? In particular, is the Higgs 
mass protected by some symmetry? Such symmetries typically predict the existence of a top partner, like in 
supersymmetric or little Higgs models [Ull]. In such a case, studying the properties of top partners allows us 
to unravel the nature of such an underlying fundamental symmetry protecting the fundamental Higgs mass at 
the scale of electroweak symmetry breaking. 

At the Tevatron, low-mass stop searches look for loop-induced stop decays [3] to charm quarks and the 
lightest neutralino [3]. Increasing the stop mass makes it more promising to look for decays to a bottom jet and 
the lightest chargino [S] , a final state irreducible from a leptonic top decay. Finally, if the stop becomes heavier 
and the strong decay in a top quark and a gluino is not yet kinematically allowed, the stop can decay into a 
top quark and the lightest neutralino [3_. This final state has the advantage that at least hadronic top quarks 
we might be able to fully reconstruct, which puts us into a promising position to study angular correlations 
in the stop pair final state. Fully hadronic top pairs from stop production are studied in the CMS TDR 
Section 13.12, but with the requirement of an additional lepton pair from the stop decays. Including this lepton 
essentially removes all QCD backgrounds. In this analysis we will show that such a lepton is not needed once 
we apply an efficient identification of boosted tops. 

There have been several suggestions as to what we might be able to say about the nature of the stop based on 
a momentum reconstruction of its visible decay products [71 [8] ; however, to date there exists no experimentally 
confirmed analysis which extracts hadronic or semi-leptonic top pairs plus missing energy at the LHC. This 
means that without a viable discovery channel all of those suggestions are bound to end up pure fiction in the 
era of actual LHC data. In this paper we will first convince ourselves that in spite of claims to the contrary 
there is no reason to assume that stop decays to semi-leptonic top quarks plus missing energy will be discovered 
at the LHC — in line with the state of the art of experimental simulations. We will then study the reach of 
fat-jet [5HTT] searches for purely hadronic stop decays and their potential when it comes to reconstructing for 
example the top momenta. In the Appendix we will give a long-overdue study of a hadronic top tagger based 
on the Cambridge/Aachen jet algorithm and a mass drop criterion. This HEPTopTagger (Heidelberg-Eugene- 
Paris) is designed to cover moderately boosted top quarks, as we also expect them for Standard Model processes 
at the LHC [llj. ^ 

II. STANDARD SEMI-LEPTONIC ANALYSIS 

Using semi-leptonic top decays to extract the signature 

PP ^ iiil ^ (tx?) (M) ^ (h]]xl) + (bjJXi) (brDx^) (1) 



The HEPTopTagger source code will be available from 



www . thphys . uni-heidelberg . de/-plehn/lieptoptagger 
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including four jets and missing energy from the irreducible top pair production requires a detailed analysis 
of the two-dimensional missing energy vector and its correlation with the visible momenta in the final state. 
The stop mass we assume to be 340 GeV [S], decaying with essentially 100% branching ratio to a top quark 
and a 98 GeV lightest neutralino. The leading-order production rate for stop pairs according to Pythia is 
around 3.2 pb, the next-to-leading order rate from Prospino is 5.1 pb [12 . To compare our result to the 
original analysis, in this section we do not apply the NLO corrections, i.e. a flat K factor of 1.59. For the same 
reason we normalize our top-pair sample to 550 pb instead of the approximate NNLO rate around 918 pb [T3|, 
corresponding io K = 1.67. The original semi-leptonic analysis starts from a set of acceptance cuts requiring 
exactly four jets and a charged lepton |S]: 

PTj >25GeV l^jl<4.0 Ai?jj > 0.4 

PT,i > 20 GeV \m\ < 2.5 ARjg > 0.4 . (2) 

The top-pair and W^-|-jets backgrounds can be reduced by an additional set of cuts, largely inspired by the 
usual semi-leptonic top analyses at the Tevatron. One of the four jets should be 6-tagged, with the appropriate 
efhciency of 60%. The different jets, the lepton and the missing energy vector have to be separated according 
to [8] 

minjARij < 1.5 > 125 GeV 

cos (j){pT, I Jt) > 0.7 0.8 < mm,Acb{^T,x) < 1.3 {x = ij) , (3) 

and the two reconstructed top decays have to be fulfilled [5] 

Iml"" - mt \ < 5 GeV (hadronic top) 
\m^^P — rnw\ > 40 GeV (leptonic top veto with nit constraint for p^i, [HI) ■ (4) 

The first condition identifies the hadronically decaying top while the second condition makes sure that once 
we include the entire missing energy from the leptonic top decay and the pair of neutralinos the mass of the 
reconstructed top candidate does not match the physical top mass [8]. It is possible to improve the leptonic 
top veto for example by solving the kinematical constraints for the top mass and requiring that this complex 
solution have the correct real part as well as a vanishing imaginary part |14[ . However, these details should not 
affect the final outcome of our analysis, as we will see from the discussion. In the following, this analysis setup 
we refer to as 'PW [8]. 

We simulate signal and background using Herwig [T31, Pythia [TB] and Alpgen-Pythia [T7] including 
initial and final state radiation, hadronization and underlying event. The top and stop samples we generate 
inclusively without restricted decays. For the fast detector simulation we rely on Acerdet [18], a reasonably 
reliable fast simulation of LHC detectors which should agree well with full detector simulation for the analysis 
presented here [T^ . The final results including the three leading backgrounds we show in Table [ij labelled 
'PW [1]. Gompared to the original work in Ref. [Sj we see that the signal efficiency is considerable lower, 
which is largely due to combinatorics in the reconstruction of the hadronic top and subsequent reconstruction 
hypotheses. In the next section, this will serve as the motivation to instead use a top tagger, which we know 
is best suited to automatically resolve combinatorial issues [TT| . 

Given our results we can slightly optimize the original semi-leptonic analysis: Instead of exactly four jets, we 
require a minimum of four jets to allow for example for initial state radiation. The b tag we apply to jets with 
Irjjl < 2.5. Finally, the hadronic mass reconstruction is considered successful if the three-jet invariant mass 
is within 15 GeV of the nominal top quark mass, instead of 5 GeV. Again, the results are shown in Table [ij 
labelled 'PSTZ'. For large stop masses we could consider applying a significantly stiffer cut on missing energy, 
but as we will discuss in the next section such a cut will leave us with essentially unknown detector fake rates. 

The key observable shown in Table |l] is the signal-to-background ratio S/B, which determines how well 
we need to know the theory and the systematics of the QCD backgrounds to extract the signal. Note that 
none of the analyses shown offers a clear side-bin background normalization. While the optimized analysis 
has an increased signal efficiency by almost a factor ten and a promising Gaussian statistical significance of 
S/y/B = 7.5 (for 10 fb~^), values around S/B ~ 1/7 are clearly insufficient to convincingly extract the stop 
signal in the presence of systematic and theory errors. 
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Table I: Signal and backgrounds for the semi-leptonic stop analysis. The three sets of results correspond to the analysis 
suggested in Ref. [8] including ISR/FSR, hadronization and fast detector simulation (PW), a slightly modified version 
of the same analysis including ISR/FSR, hadronization and fast detector simulation (PSTZ), and the numbers from 
Ref. [8] adjusted for all electron and muon final states, without ISR/FSR or hadronization or a complete fast detector 
simulation. All rates are given at leading order, to allow for a comparison with the original numbers in the last column. 

The background results in Table |I] should still be taken with a grain of salt. While our signal efhciencies are 
in good agreement between Pythia and Herwig (Fortran and C++), the background numbers are sensitive to 
the underlying event. We can check this effect by turning on/ofF the multi-parton interactions in HerwigH — h, 
which leads to a decrease of the background rejection by an order of magnitude. However, this does not affect 
the conclusion of this section, namely that semi-leptonic stop searches are very unlikely to be visible at the 
LHC. This is a generic statement in the sense that looking at the systematic uncertainties we need to overcome 
a relative factor of 0(200) between the stop signal and the top background rates and to our knowledge there 
is no kinematic cut which for generic mass spectra significantly improves this ratio after including detector 
smearing and fakes ^U\ . 

III. HADRONIC FAT-JET ANALYSIS 

Given that the semi-leptonic analysis shown in the last section is unlikely to work at all, an alternative strategy 
would be to search for stop pairs in purely hadronic top decays. Those would allow us to fully reconstruct the 
final state and analyze the angular correlation in detail: 

pp iJl ^ (tx?) m) ^ (fejJX?) (bjjx',) ■ (5) 

Our hadronic stop analysis is based on two tagged hadronic top quarks, using the algorithm described in the 
Appendix. Tagging W bosons in their decays to geometrically large jets [2T] has been around in the LHC 
literature for quite a while, including its applications in searches for supersymmetry [22] • Higgs tags can be 
implemented in a similar manner, and as it turns out they show the best performance [^ 11111^^123] when based 
on the purely geometric C/A jet algorithm [241 125j . Inspired by searches for very heavy resonances decaying 
to top pairs [HZj several top taggers have been developed, again in the same spirit, but based on different jet 
algorithms as well as on jet shapes [lOl dH [2H1 [29] ■ One disadvantage of most of these top taggers is that they 
are not designed to work for the kind of transverse momenta we can expect in Standard Model processes. This 
means that unlike the W and Higgs taggers [SI [3U] , top taggers might be very hard to establish experimentally. 
Following the ttH analysis [TT] we slightly refine our top tagger for moderate top boosts and apply it to this 
new challenge: extract a new-physics signal from purely hadronic final states and reconstruct its kinematics. 

For triggering we expect our signal events to pass the jets plus missing energy trigger at the LHC. To extract it 
from the backgrounds we can employ the recently developed fat-jet tools which aim at tagging a boosted top jet 
without being killed for example by combinatorics. We start by constructing jets using the Cambridge/ Aachen 
algorithm [21], implemented in Fast jet [55], with R ~ 1.5 and requiring at least two jets with 

PT,j > 200/200 GeV > 150 GeV . (6) 

Those two cuts are chosen to obtain the largest signal-to-background ratio S/B. To reduce the probability of 
fake missing energy due to detector effects we require the two-dimensional missing energy vector to be well 
separated from the jets, to avoid cases where missing energy is generated by just mis-measuring one jet. This 
should leave us with a suppression factor of 1% for fake missing energy above > 150 GeV in QCD jet events 
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Figure 1: Normalized mT2 distributions for the stop signal (m^ — 340 GeV) and the tt background, after reconstructing 
two (real of fake) hadronic top quarks. The hypothetical LSP mass we set to m^o = GeV (left) or to the correct value 



of m^o = 98 GeV (right). 



without any physical missing energy |19j . which we apply in the following. Next, we veto isolated leptons with 
Pt,i > 15 GeV, \rii\ < 2.5, requiring E}^""^ < CIE'^^ within R < 0.3 around the lepton. 

At this level we apply the top tagger described later and in the Appendix and require two tops to be identified 
and reconstructed. Finally, after requiring one b tag inside the first tagged top we construct mT2 |26j . Assuming 
we do not know the LSP mass, i.e. setting it to zero in the mT2 construction, we require 

mT2 > 250 GeV . (7) 

While in Table |ll] we will see that this cut has hardly any impact on the signal significance S/ Vi?, at least for 
small stop masses, we apply it to increase the signal-to-background ratio S/B and hence become less sensitive 
to systematic and theory errors. 

Constructing the tot2 distributions has two motivations, of which the background rejection cut might even 
be the lesser. From the two panels of Figure [l] we see that 771^2 with an assumed massless LSP is better suited 
to distinguish the stop signal from the top background. As expected. Figure [T] also shows that for larger stop 
masses this cut becomes increasingly effective. More importantly, once we know the correct value of m^o we 
can determine the stop mass from the endpoint of the mT2 distribution. Determining the uncertainties of such 
a mass measurement, however, is beyond the scope of our phenomenological analysis. Obviously, due to the 
wrong decay topology the endpoint of the tt background has nothing to do with the physical top mass, so we 
cannot use it to gauge the stop mass measurement. 

For a double Standard Model top tag the mis-tagging probability when applied to a pure QCD or M^-|-jets 
sample after our process specific cuts turns out to be (not much) below 0.1%, comparable to the numbers 



quoted in the Appendix, Table III From the first column of Table Wit i s clear that such a reduction rate is 
not sufficient. Therefore, we follow the example of the Higgs tagger [9|lllj and apply an additional b tag inside 
the main constituents of the first tagged top. Limiting this b tag to the three main constituents of one specific 
tagged top reduces the fake rate in particular from charm jets or gluons splitting into bb pairs. Assuming a 
60% tagging efficiency and a light-flavor rejection around 1/50 this will give the first top tag a mistag rate 
well below 0.1%. As it will turn out, this is sufficient to render the QCD and M^-|-jets backgrounds negligible 
compared to the tt background. Charm jets in the QCD jets sample we do not expect to be a problem. On the 
one hand, they have a 10% mis-tagging probability for our b tag, but on the other hand the will appear much 
less frequently, based for example on the reduced probability of gluon jets splitting into quarks — a factor 1 /4 
from counting quark fiavors in (7 — > alone. Last but not least, given the moderate boost of the top quarks 
we check that including a (0.1, 0.1) granularity of the detector in a lego plot has no impact on our analysis. 

The large transverse momentum of the two candidate fat jets in Eq.([6]) allows us not to worry about triggering 
on the one hand and to generate events with a sizeable efficiency — for the actual analysis this cut has little 
effect, because inside the top tagger we apply a lower cut on the transverse momentum of the reconstructed 
top > 200 GeV. We explicitly check this by lowering the acceptance cuts to prj > 100 GeV and find no 
effect on the final numbers of the analysis. 
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The different steps of our analysis are illustrated in Table [TTj for different stop masses and the leading 
backgrounds. The event numbers are normalized to NLO cross sections for the stop pair signal and the leading 
tt background. For QCD and W^+jets the normalization after cuts has many sources of mostly experimental 
uncertainty that we can as well stick to the leading order normalization. 

Stop pair signal — in contrast to for example Higgs signals strongly interacting new particles will be produced 
with sizeable rates at the LHC. For identical masses, the production rate for stop pairs is actually the smallest 
of all QCD-initiated supersymmetric processes, due to the large number of essentially degenerate light-flavor 
squarks at the LHC, the fundamental color charge of the stops, and the lack of a t-channel qq production 
process. Typical NLO cross sections for stop pair production range from 5.1 pb (m^ = 340 GeV) to 0.4 pb 
(540 GeV) and 0.15 pb (640 GeV) [12 . After requiring missing energy, two top tags and one b tags we are left 
with several fb of rate. As we can see in Figure [l] the stiff mT2 cut is not particularly efRcient, in particular 
for small stop masses, but it does give us the necessary handle to suppress the tt background to a level of 
5/5^0(1). 

Top pair background — as we know from the semi-leptonic analysis and as we can see in Table ITTl top pair 
production is the most dangerous background to stop searches. Its total rate shows a relative ermancement 
of several hundred over the signal and two physical tops can be tagged including the b jet we are requiring. 
Purely hadronic top decays are reduced by our missing energy cut in analogy to the pure QCD background, i.e. 
by a factor 1/100. Semi-leptonic top decays are more dangerous, since after one top tag the discussion in the 
Appendix shows that there is very likely a second top tag based on recoiling QCD jets. After two top tags the 
tt background is still larger than the signal. Therefore, we apply a cut on tot2, clearly distinguishing missing 
energy from two LSPs to large missing energy from one neutrino in the semi-leptonic top background. 
QCD background — just because of its sheer size QCD jet production tends to be an unsurmountable back- 
ground at the LHC. After requiring two hard jets we are still left with more than 10*" fb of rate. As discussed 
in the Appendix we cannot suppress such a rate only using the kinematic features of the top decay. The 
probabilistic treatment of fake missing energy (1/100) and one b mis-tag (1/50) give us an additional suppres- 
sion, where after two top tags we arrive below the tt background. Note that we cannot assign a mT2 survival 
probability to the QCD background, since we do not know the distribution of the detector-fake missing energy 
vector. However, because this fake missing energy will be uncorrelated with the other momenta in the event, 
just like one additional missing particle, we estimate the efficiency by the tt value around 22%. If for some 
reason QCD jet production should still pose an experimental problem there is the option of requiring a b tag 
also in the second reconstructed top jet. 

W^-f jets background — in contrast to QCD jets production this process includes actual missing energy. Techni- 
cally, we simulate this background using Alpgen [17] with four hard jets plus additional coUinear jet radiation. 
The VF-l-jets rate only exceeds the signal rate by less than a factor 100, so applying the basic cuts and requiring 
two tagged top quarks reduces it to a level we can deal with. The b tag and the additional cut on mT2 reduce 
the VF-|-jets background to a level where it is hard to predict without sufficient statistics. Irrespective of the 
details we can conclude that M^-|-jets do not pose a problem to the stop pair search. 

Z-|-jets background — because of the significantly smaller rate, the slightly lower invisible Z branching ratio and 
the sizeable probability to miss the lepton from the W decay we can safely assume that the Z+jets background 
will be as irrelevant as the W^-|-jets background after cuts. Numerically, even with too low statistics for a 
detailed analysis we see that after cuts the {Z — ^ i/i^)-fjets background is always smaller than the VF-fjets 
background by a factor 0(1/3) and hence irrelevant. 

The right columns of Table [IT] clearly show that extracting hadronic stop pairs from the different Standard 
Model backgrounds will not be a problem at all. The statistical significance is above the discovery limit already 
with an integrated luminosity of 10 fb~^. The event numbers are not huge, but a more careful statistical 
treatment for example of our crude cut will change this easily. In contrast to semi-leptonic stop de- 

cays systematics will not pose any problem either, possible complications from jet combinatorics should be 
automatically resolved by the top tagger [TT] . 

One curious feature we see once we increase the stop mass: for a constant LSP mass the increase in the cut 
efficiencies actually over-compensates the decrease in the stop production rate. This is most obvious for the 
mT2 cut shown in Figure [l] but it also holds for example for the top tagging efficiency which benefits from the 
increased stop-neutralino mass difference. Table |TT] therefore does not give a good estimate of the stop mass 
reach. To answer this question we would need to adjust the and mT2 cuts which as it stands are optimized 
for 340 GeV stops. 
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Table II: Signal (for different stop masses) and bacfcgrounds for tlie hadronic fat-jet analysis. All numbers given in fb, 
the significance is computed for 10 fb~^. The tiil and ti rates are normalized to their higher-order values [121 113j . 
Z-|-jets we simulate with the neutrino decay specified. 

Moreover, it is clear that from the endpoints of the tot2 distributions we should be able to measure the stop 
mass (or better the stop-neutralino mass difference) in this process. While making this quantitative statement 
does not require any further work, actually estimating the experimental error on stop mass measurements using 
fat jets goes far beyond what we can do in this paper. We therefore refrain from quoting any number for the 
stop mass measurements and leave it at this statement and the encouragement for a detailed experimental 
analysis including full detector simulation. For supersymmetric parameter analyses such a measurement would 
of course be hugely beneficial [311 132] ■ 

IV. OUTLOOK 

We have shown that while semi-leptonically decaying stops are unlikely to be observed at the LHC, a fat -jet 
analysis should be able to discover purely hadronically decaying stops with typical integrated luminosities of 
10 fb^^ at 14 TeV. This is true for stop masses above 340 GeV (for mLSP = 98 GeV) and extends to stop masses 
well above this range. The stop mass reach based on hadronic decays can be extended more by scaling the 
different cuts with the stop-neutralino mass difference. Moreover, our limiting factor is somewhat inefficient 
cuts to improve S/B, so we expect this result to improve significantly once modern statistical methods are 
applied. 

The dominant background after cuts and reconstruction is exclusively tt production, which we can reduce 
to the S/B ^ 1 level. QCD jet production is suppressed to a small fraction of the tt background, and y-|-jets 
backgrounds are negligible. This promising result relies on two tagged and reconstructed top quarks, which in 
turn allow us to use mT2 constructed from the top momenta and the missing energy vector. Combinatorics are 
automatically resolved by the top tagging algorithm. 

The fact that we can reconstruct the top momenta should allow the LHC to analyze in detail the nature 
of a top partner decaying to a top quark and a dark matter agent. Moreover, because of the large signal-to- 
background ratio S/B — 0(1) we will be able to use the endpoints of the mT2 distribution to measure the stop 
mass once we know the LSP mass. Determining the experimental uncertainties for this mass measurement we 
have to leave to an experimental study including a full detector simulation. 

As shown in detail in the Appendix our HEPTopTagger algorithm is not only well suited to detect stop pairs 
at the LHC. It can be tested in Standard Model top pair production and it can be applied to a large variety 
of problems where standard methods fail, for example due to jet combinatorics. In one such application, high 
multiplicities of final states from longer decay chains will be automatically resolved. In the current form the 
top tagger relies on a Cambridge/ Aachen algorithm with a mass drop criterion and a set of invariant mass 
constraints. Once we require a fat jet with pT > 200 GeV our top tagging efficiency can reach the 40% to 50% 
range for reasonably boosted tops with mis-tagging probabilities around a few per-cent. 
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Appendix A: HEPTopTagger: Boosted Tops in the Standard Model 

Top taggers are algorithms identifying top quarks inside geometrically large and massive jets. They rely on 
the way a jet algorithm combines calorimeter towers into an actual jet. An obvious limitation is the geometrical 
size of the jet which for a successful tag has to include all three main decay products of the top quark. At 
the parton level we can compute the size of the top quark from the three R distances of its main decay 
products: following the Cambridge/ Aachen algorithm [241 [25] we first identify the combination with the 
smallest ARij . The length of the second axis in the top reconstruction we obtain from combining i and j and 
computing the R distance of this vector to the third constituent. The maximum of the two R distances gives 
the approximate partonic initial size ARbjj of a C/A jet covering the main top decay products. In Figure [2] we 
first correlate this partonic top size with the transverse momentum of the top quark for a complete tt sample 
in the Standard Model. As expected, if for technical reasons we want to limit the size of the C/A fat jet to 
values below 1.5 we cannot expect to see top quarks with a partonic transverse momentum of pt ^150 GeV. 
In the right panel we show the same correlation, but after tagging the top quark as described below and based 
on the reconstructed kinematics. The lower boundaries indeed trace each other, and the main body of tagged 
Standard Model top quarks resides in the p^^t = 200 • • • 250 GeV range, correlated with ARlJj = 1 • • • 1.5. This 
result illustrates that for a Standard Model top tagger it is indeed crucial to start from a large initial jet size. 

Therefore, our tagger for Standard Model tops is based on the Cambridge/ Aachen [24ll25] jet algorithm with 
i? = 1.5, combined with a mass-drop criterion [SHU]- Because the generic pt range for the tops does not exceed 
500 GeV the granularity of the detector does not play a role, and we can optionally apply a b tag to improve 
the QCD rejection rate. Since such a subjet b tag [30] will only enter as a probabilistic factor (60%, 10%, 2%) 
for (6, c, q/g) jets we do not include it in the following discussion. Note that whenever we require a b tag in our 
actual analysis, the numbers do not yet include the (70%, 1%) improvements found for a b tag inside a boosted 
Higgs [Sg. 

The algorithm proceeds in the following steps: 
1. define a fat jet using the C/A algorithm with R = 1.5 




Figure 2: Left: partonic ARbjj vs pr distribution for a Standard Model ti sample. Right: the same correlation, but 
only for tagged top quarks and based on the reconstructed kinematic properties. 
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2. for each fat jet, find all hard subjets using a mass drop criterion: when undoing the last clustering of the 
jet j, into two subjets ji,j2 with rrij-^ > rrij^, we require rrij^ < 0.8 mj to keep ji and j2. Otherwise, we 
keep only ji. Each subjet ji we either further decompose (if rrij. > 30 GeV) or add to the list of relevant 
substructures. 

3. iterate through all pairings of three hard subjets: first, filter them with resolution -RfiUcr — 
min(0.3, Ai?jfc/2). Next, use the five hardest filtered constituents and calculate their jet mass (for less 
than five filtered constituents use all of them). Finally, select the set of three-subjet pairings with a jet 
mass closest to mt. 

4. construct exactly three subjets ji, j2, is from the five filtered constituents, ordered by pt- If the masses 
(mi2, mi3, 77123) satisfy one of the following three criteria, accept them as a top candidate: 



0.2 < arctan— < 1.3 



and R„ 




< 1 - 



< 1 



■mi23 

m23 

■mi23 



< < 

'71123 




and 



and 



m23 
TO123 

"723 
"7123 



> 0.35 



> 0.35 



(Al) 



with i?inin = 85% X mw I mt and i?max — 115% x tovk / to* . The numerical soft cutoff at 0.35 is independent 
of the masses involved and only removes QCD events. The distributions for top and QCD events we show 
in Fig.[3| 



5. finally, require the combined px of the three subjets to exceed 200 GeV. 



In step 3 of the algorithm there exist many possible criteria to choose three jets from hard subjets inside a fat 
jet. For example, we can include angular information (the W helicity angle) in the selection criterion and select 
the smallest Am* + A^yArnvK + ^/tA coSh- In that case, the tagging efficiency increases, but simultaneously the 
fake rate also increases, so to reach the best signal significance we simply select the combination with the best 
mt- This allows us to apply efficient orthogonal criteria based on the reconstructed mw and on the radiation 
pattern later. 

In step 4, the choice of mass variables shown in Figure [3] is of course not unique. In general, we know that 
in addition to the two mass constraints (771123 — ^1°^ as well as rrijk — rrf^ for one (j, k)) we can exploit one 
more mass or angular relation of the three main decay products. Our three subjets jk ignoring smearing and 
assuming pf ^ give 



"7* = "7?23 = {Pl +P2+ P3f = (Pl + V2f + {Pl +P3f + {P2 + Psf = + n^l3 + ^23 > (^2) 
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m,[GeV] m,[GeV] 

Figure 4: Number of tagged tops in the tt, QCD jets and VK+jets channels for a varying assumed top mass in the 
tagging algorithm. The actual top mass in the sample is 172.3 GeV. Shown is the number of tagged first (left) and 
second (right) tops. 



which is the surface of a sphere with radius mt in (toi2, "t-is, "^23)■ For fixed TO123 we can pick exactly two 
more variables to fully describe the kinematics: we choose m23/mi23 and arctan 77113/77112, which means that 
1^12/^123 can be derived as 




( + ( ™23 \ 
\mi2 J / V"^123/ 



(A3) 



Assuming 771123 = ™t the condition 77712 = i 15% then reads 77112/777123 = Rmin - - - -"-maxj 
we use in Eq.( |Al[ ). Note that our three mass conditions can also be written in terms of two masses and the 
W helicity angle [TUl E] , but the construction of this angle requires a boost into the W rest frame with its 
experimental challenges which we prefer to avoid. The switch from the helicity angle scheme to the pure mass 
scheme only has a negligible effect on the efficiencies computed without full detector simulation. 

Finally, in contrast to the Higgs tagger [5J [TT] the top tagger does know about the top mass when searching 
for the two mass drops. This means that we will not be able to apply a side-bin normalization. However, 



we can access side bins by changing the assumed mt as used in the algorithm, Eq.(Al), to values different 
from the top mass in the event sample. The result of such a misalignment we show in Figure |4} for the QCD 
and VF+jets background the number of tagged tops follows the typical px dependence of the jet sample. The 
lower the top mass we are looking for the more tagged tops we will find. In contrast, the top sample shows a 
clear peak when the assumed top mass in the algorithm coincides with the top mass in the sample. Towards 
larger assumed top masses the distribution shows a one-sided width around 20 GeV. Towards smaller assumed 
top masses additional QCD jets can have an increasing impact on wrongly tagged tops. Therefore, the tail is 
considerably higher. While this kind of behavior makes it unlikely that such side-bins will useful for an actual 
analysis they serve as a very useful cross check for our fat-jet methods. 

In Figure [5] we summarize the performance of the tagging algorithm described above. In the left panel, we 
show the parton-level pt of the hadronic top quarks in the ti sample, normalized to the top production rate. 
As we already know from Figure [2] this distribution drops rapidly and essentially vanishes for pr > 500 GeV. 
This is the reason why our tagging algorithm focuses on a top px range between 200 and 500 GeV. The curve 
for tagged tops follows the curve for produced tops smoothly for pT > 250 GeV. The same curve for tagged 
tops is actually included in all three panels, different just because of the normalization of the plots. The two 
curves for mis-tagged tops in the W^-|-jets and QCD sample are shown as a function of the reconstructed px of 
the top constituents in the last step of our algorithm. Again, they are normalized to the production rate at 
the LHC, so we immediately see that one top tag will not be sufficient to reduce the pure QCD background 
to the level of top pair production. The W^-f jets background, in contrast, should not pose a problem to fat -jet 
analyses, which we confirm in our actual analysis and show in Table |TTj 

In the center panel of Figurejsjwe show the fraction of tops found inside C/A distances of AR^jj < 0.9, 1.2, 1.5, 
normalized to the number of tops produced (i.e. the top production line in the left panel). As indicated in 
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P^[GeVl P^[GeV] p^[GeVl 

Figure 5: Left: number of tops, tagged tops, mis-tagged tops from QCD jets and for W+jets for 1 fb~^. Center: fraction 
of hadronic tops whose main parton-level decay products are within a C/A distance of ARbjj — 1.5, 1.2,0.9. Right: 
tagging efficiencies, normalized to the top line of the central panel. 

Figure [21 in particular in the promising Standard Model range pT,t < 300 GeV we lose the vast majority of 
events if we reduce the jet size from R = 1.5 to R = 1.2. We also show the fraction of tagged tops based on 
R = 1.5, showing an efficiency of 20% to 40% relative to all tops produced. 

In the right panel of Figure |5] we show the top tagging efficiency as a function of the reconstructed top px, 
normalized to the number of tops within ^Rbjj < 1-5 (the top line in the center panel). The first line shows 
how many of the tops end up with all main parton-level decay products inside the fat jet, as requested in step 1 
of our algorithm. There is a loss associated with this actual construction of the fat jet, because even if all 
main top decay products are close enough to end up inside a fat jet of size R = 1.5, they do not have to. For 
example, the geometric center of the fat jet can be slightly shifted, so one of the top decay products drops out. 
The second line shows the fraction of tagged tops. The shading indicates the fraction of these tops where we 
cannot establish a one-to-one connection between the three subjcts constructed in step 4 and the parton-level 
top decay products. 

To establish such a connection we compute the R distances between the three subjets and all hard partons 
in the event. We then identify the parton pairing which gives the smallest value of Ai?^^^, = AR{ji,pi)^ + 
Ai?(j2,Pj)^ + Ai?(j3,pfc)^ and check if this pairing corresponds to a top decay at parton level. If not, we assume 
that either a QCD jet might have entered the reconstruction or that QCD radiation has bent one of the top 
decay jets far away from its partonic origin. However, this rate is considerably higher than the M^-l-jets mis-tag 
rate, so these events are not dominated by continuum QCD jet production. Instead, they represent the generic 
problem of identifying partons with jets by some kind of geometric measure. In a way these tags are the tricky 
ones for low transverse momenta, while the efficiency for identifiable tags is a fairly constant 0(40%) over the 
entire pt range. 

The bottom line in terms of tagging efficiencies and mis-tagging probabilities we show in Table |III[ provided 
we find something like a fat jet a top can be tagged with an efficiency of 23% to 51%, dependent on the pt 





tt QCD W+iets 


tt QCD W+iets 




PTT[GeV] 


200 300 


200 300 




one fat jet 
two fat jets 


92200 36100 8250 4.10 • lO' 3.19 • 10^ 
40700 20300 5810 2.16 • 10^ 1.60 ■ 10^ 


100% 100% 100% 100% 100% 
44% 57% 70% 53% 50% 


relative to one fat jet 


one top tag 
two top tags 


20900 13400 4160 8.18- lO"" 1.27- 10" 
1880 1630 700 11000 233 


23% 37% 51% 2.0% 3.9% 
2.0% 4.5% 8.5% 0.027% 0.07% 
4.5% 8.0% 12% 0.05% 0.15% 


relative to one fat jet 
relative to one fat jet 
relative to two fat jets 



Table III: Number of events in 1 fb~^ and their relative tagging efficiencies for the first and the second top tag in the 
different Standard Model event samples. One top tag and one fat jet means at least one tag or fat jet. Fat jets are 
defined with pr > 200 GeV. The top quarks are produced with pr > 0, 200, 300 GeV at parton level. All rates are 
quoted at leading order. 
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Figure 6: From left to right: AT? between the reconstructed and the parton-level top quark; Apt/Pt"^ for the same 
sample; Ap/p'®"^ where p denotes the absolute value of the top 3-momentum. The solid curves correspond to the default 
Py ( > 200 GeV of the tagged top, while the dashed curves constrains the tagged top to values above p^'j > 300 GeV. 



range of the top. This variation shows that for low generated pT,t there will still be a fat jet in the tt sample, 
but this fat jet will tend to not include the top decay products, so we cannot tag a top to begin with. 

For a second top tag we first need to see another fat jet in the sample. For top pairs this will happen in 44% 
of all events, up to 70% for hard tops. However, there are two tops in the event, and there will likely be two 



fat jets. For px.t > 200 GeV the 37% tagging efficiency quoted in Table III corresponds to a 26% efficiency 
of tagging a top in a given fat jet. Based on this number we can compute the probability of tagging two tops 
in two fat jets, which gives slightly less than 4%. So in particular for low-pr tops our efficiency for a second 
top tag is higher than for the first. For the signal discussed in the main body of the paper we would need to 
fold this efficiency with the px spectrum of tops from stop decays. From the 200 GeV and 300 GeV columns 
in Table [III] we see that this will help considerably. 

For W^+jets and pure QCD jets with their generically softer QCD structure it will not be as likely to actually 
find the first fat jet in the sample. In addition, the mis-tagging rate for the first top tag after seeing a fat jet 
ranges around 2% to 4%. The efficiency for a second fat jet in the background processes is almost as large as 
for top pairs. This reflects the fact that one hard fat jet has to recoil against QCD activity which will give us 
a second fat jet. The probability of mis-tagging two tops is then roughly the first mis-tag probability squared, 
after factoring out the probabilities of finding one or two fat jets. This way, the over-all efficiencies for two 
top tags significantly enhance the signal-to-background ratio, in particular for pure QCD jets. As mentioned 
several times, this number can be improved if we ask for a b tag inside the top jet. As a last comment, our 
top tagger is optimized for low px.ti so further work and modifications should be able to increase its efficiency 
towards higher boosts. 

The last question beyond the simple top tag is how well the algorithm described in this Appendix can 
reconstruct the top momentum. In principle, our top tagging algorithm can identify the three subjets as either 
W-decay jets or the b jet. Unfortunately, even amongst the events which allow for a clear comparison of the 
partonic top decay products and the resulting subjets a fair fraction returns mi,j ^ 80 GeV on the parton level. 
This is because the invariant masses of all jet combinations reside in the same range, out of which our W mass 
window represents a sizeable fraction. To test for such effects we can take all tagged tops in the hadronic tt 
sample and compare the reconstructed top momenta to the parton-level input. In Figure |6] we first show the 
angular distance of the reconstructed top quarks from the parton-level truth. 

While there is a strong peak for Ai? < 0.5 and 95% of the events resides in the area, we also observe a long 
tail at the 10"'^ level, which is due to combinatorics or effective QCD mis-tags in the top sample. In the second 
panel we show the relative error on the transverse momentum of the tagged top {Apx = p"'^ — p^^^)- For 
around 85% of the events the mis-measurement compared to the parton-level truth stays below the 20% level. 
In the third panel we show the same for the entire 3-momentum. Again, 68% of the tops are reconstructed at 
the 10% level, while 80% are reconstructed within Ap/p ~ 20%. Obviously, all these numbers can be improved 
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if we increase the pf?'" cut on the reconstructed top for example from 200 GeV to 300 GeV. 
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